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establishing for each molecule a respective description, by comparison of the molecule's 
molecular structure to i set of molecular substructure keys; 

grouping the molecules based on similarity of their respective descriptions, so as to define 
groups of structurally similar molecules; 

selecting at le&:t one ocfthe groups of structurally similar molecules based on an extent to 
which the molecules in the selected group have activity characteristics reflecting ihe given 

activity; 

for each of the at least one selected group, identifying at least one molecular feature set 
common to all of the n.olecules in the selected group; and 

outputting data indicative of at least one identified molecular feature set. 

70. (New) The computerized method of claim 69, wherein grouping the molecules 
based on similarity of iheir respective descriptions comprises: 

applying a clus ering algorithm to cluster the molecules based on their descriptions. 

/ 1 . (jNewj i he computerized method of claim 70, wherein the clustering algorithm 
comprises SOM clusteing. 

72. (New) The computerized method of claim 69, wherein selecting at least one of 
the groups based on an extent to which the molecules in the selected group have activity 

Electing a group m response lo me group containing r»r joast ?. predetermined number oi 
molecules m the group that have activity characteristics reflecting the given activity. 
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73, (New) The computerized method of claim 69, wherein selecting at least one of 
the groups based on an extent to which the molecules in the selected group have activity 
characteristics reflectirig the given activity comprises: 

selecting a gro ip in response to at least a predetermined percent of the molecules in the 
group having an activi y characteristic reflecting the given activity. 



74. (New) The computerized method of claim 69, wherein identifying at least one 
molecular feature set common to all of the molecules in the selected group comprises identifying 
a maximum common substructure of the molecules in the selected group. 



75. (New) The computerized method of claim 74, wherein identifying a maximum 
common substructure of the molecules in the selected group comprises applying subgraph 
isomorphism to compare the descriptions of the molecules in the selected group. 



CWew ; i ne computenzed method of claim 74, wherein identifying a maximum 
common substructure of the molecules in the selected group comprises applying a genetic 
algorithm to compare ihe descriptions of the molecules in the selected group. 



77. (New) The computerized method of claim 69, wherein the molecular feature set 
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78. (New) The computerized method of claim 69, wherein the molecular feature set 
common to all of th< molecules in the selected group is a non-contiguous combination of 
molecular features. 

79. (New) The computerized method of claim 69, wherein identifying at least one 
molecular feature set common to al! of the molecules in selected group comprises: 

identifying a plurality of molecular feature sets each common to all of the molecules in 
the selected group. 

80. (New) The computerized method of claim 79, wherein outputting data 
indicative of at least one identified molecular feature set comprises: 

outputting data indicative of the plurality of molecular feature sets. 

8L (New) A computer-readable medium containing a set of machine language 
instructions executable by a computer to carry out the method of claim 69. 



82. (New) A machine programmed with machine language instructions executable 
by a processor to carry out the method of claim 69. 

83. (New) A method of identifying a molecular feature set likely to be responsible 

iccciving into . computer a set or input data that represents molecules mid that defines, 
respectively for each irolecule, a molecular structure and an activity characteristic; 
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operating the computer to establish for each molecule a respective description vector, by 
comparison of the molecule's molecular structure to a set of molecular substructure keys; 

operating the computer to apply a clustering algorithm so as to sort the molecules into 
groups based on similarity of their respective description vectors; 

operating the computer to select at least one of the groups based on an extent to which the 
molecules in the selected group have activity characteristics reflecting the given activity; 

operating the computer to identify, for each of the at least one selected group, a maximum 
common substructure of the molecules in the selected group; and 

outputting from the computer data indicative of at least one identified molecular feature 

set. 

84. (New) A computerized method of identi lying a molecular feature set likely to 
be responsible for a given activity, based on a set of input data that represents molecules and that 
defines respectively for each molecule a molecular structure and an activity characteristic, the 
method comprising: 

(a) establishing for each molecule a respective description, by comparison of the 
molecule's molecular structure to a set of molecular substructure keys; 

(b) groupirg the molecules based on similarity of their respective descriptions, so as 
to define groups of structurally similar molecules; 

(c) selecting at least one of the groups of structurally similar molecules based on an 

.riven acnvit\ . 
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(d) for eacli of the at least one selected group, identifying at least one molecular 
feature set common to all of the molecules in selected group; 

(e) adding it least one identified molecular feature set as a new substructure key in 
the set of molecular substructure keys> and then repeating elements (a) through (d); and 

(f) outputthg data indicative of at least one identified molecular feature set. 



85. (New) The method of claim 84, wherein outputting data indicative of at least 
one identified molecular feature set comprises: 

determining wliich identified molecular feature set has the most atoms, and outputting 
data indicative of that molecular feature set. 

86. (Ney) The method of claim 84, wherein grouping the molecules based on 
similarity of their respective descriptions comprises SOM clustering the molecules based on their 
respective descriptions 

87. (New) The method of claim 86, wherein SOM clustering the molecules results 
in a SOM grid reflecting clusters of molecules, and wherein outputting data indicative of at least 
one identified moIecuLir feature set comprises; 

outputting a screen display that depicts contents of the SOM grid. 
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89. (New) A machine programmed with program instructions executable by a 
processor to carry out the method of claim 84. 

90, (N6w) A processing system for modeling chemical structure-activity 
relationships through artificial intelligence analysis of an input data set representing molecules, 
each of the molecules having a set of features and an activity characteristic, the processing 
system comprising, in vMnhination* 

means for establishing for each molecule a respective description, by comparison of the 
molecule's molecular structure to a set of molecular substructure keys; 

means for grouping the molecules based on similarity of their respective descriptions, so 
as to define groups of structurally similar molecules; 

means for selecting ax least one of the groups of structurally similar molecules based on 
an extent to which the molecules in the selected group have activity characteristics reflecting the 
given activity; 

means for identifying at leaftt one molecular feature set common to all of the molecules in 
each of at least one sel ected group; and 

means for outp atting data indicative of at least one identified molecular feature set, 

Remarks 

In the Office Action mailed July 16, 2001, the Examiner rejected claims 22-38, 63-66 and 
b> Hibert et a!. "Graphics Computer Aided Receptor Mapping as a Predictive fwol ;br Drik. 
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